ON PRIMITIVE DIVISORS OF n 2 + b 



GRAHAM EVEREST AND GLYN HARMAN 

Abstract. We study primitive divisors of terms of the sequence 
P n = n 2 + 6, for a fixed integer b which is not a negative square. It 
seems likely that the number of terms with a primitive divisor has 
a natural density. This seems to be a difficult problem. We survey 
some results about divisors of this sequence as well as provide upper 
and lower growth estimates for the number of terms which have a 
primitive divisor. 



1. Primitive prime divisors 

Given b, an integer which is not a negative square, consider the inte- 
ger sequence with nth term P n = n 2 +b. It seems likely [5] that infinitely 
many of the terms are prime but a proof seems elusive. Perhaps this 
mirrors the status of the Mersenne Prime Conjecture, which predicts 
that the sequence with nth term M n = 2 n — 1 contains infinitely many 
prime terms. At least with the Mersenne sequence, an old result shows 
that primes are produced in a less restrictive sense. 

Definition 1.1. Let (A n ) denote a sequence with integer terms. We 
say an integer d > 1 is a primitive divisor of A n if 

(1) d\A n and 

(2) gcd(<i, A m ) = 1 for all non-zero terms A m with m < n. 

In 1886 Bang [2] showed that if a is any fixed integer with a > 1 
then the sequence with nth term a n — 1 has a primitive divisor for any 
index n > 6. This is remarkable because the number 6 is uniform across 
all a and it is small. Before we say any more about polynomials, a short 
survey follows indicating the incredible influence of Bang's Theorem. 

1.1. Primitive divisor theorems. In 1892 Zsigmondy obtained the 
generalization that for any choice of a and b with a > b > 0, the 
term a n — b n has a primitive divisor for any index n > 6. This lovely 
result was re-discovered several times in the early 20th century and 
it has turned out to be quite applicable. See [2S] and the references 
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therein where applications to Group Theory are discussed. For ex- 
ample, the order of the group GL n (¥ q ) has a primitive divisor for all 
large n. Thus Sylow's Theorem can be invoked to deduce information 
about the structure of the group. 

The next major theoretical advance was made by Carmichael. Let u 
and v denote conjugate quadratic integers; in other words, zeros of a 
monic irreducible polynomial with integer coefficients. Consider the 
integer Lucas sequence defined by 

U n = (u n -v n )/(u-v). 

The Fibonacci sequence (F n ) arises from the roots of the polynomial 
x 2 — x — 1. Carmichael [5] showed that if u and v are real then U n 
has a primitive divisor for n > 12. This is a sharp result because F 12 
does not have a primitive divisor. Less is currently known about the 
corresponding Lehmer-Pierce sequence 

V n = (u n -l)(v n -l). 

Kalman Gyory pointed out to the first author that if uv — 1 then V n 
has a primitive divisor for all n beyond some (actually uniform) bound; 
on the other hand, if uv = — 1 then V2k does not have a primitive 
divisor if k is odd, because Vik = — V£. This second observation is 
actually quite germane to this paper; see Theorem 11.21 In fact the 
set of terms with a primitive divisor has natural density equal to | 
(cf. Conjecture II. 5p . At the conference, Richard Pinch remarked that 
certain Lehmer-Pierce sequences count orders of groups: this time the 
groups are E(¥ p n), where E denotes an elliptic curve. 

Bilu, Hanrot and Voutier [I] used powerful methods from Diophan- 
tine analysis to prove, in the general case, that U n has a primitive 
divisor for any n > 30. Again this is a sharp result as the sequence 
generated by the polynomial x 2 — x + 2 illustrates. Finally, Silverman 
[28] obtained a primitive divisor theorem for Elliptic Divisibility Se- 
quences and a uniform version appears in [12] for a certain class of 
sequences. 

1.2. Primitive divisors of n 2 + b. 

Theorem 1.2. Infinitely many terms of the sequence n 2 + b do not 
have a primitive divisor. 

The proof of Theorem 1 1 . 21 follows very easily from a result of Schinzel 
[26] and will be discussed shortly. Schinzel's proof manufactures a very 
thin set of terms with no primitive divisor. Dartyge [8] has improved 
Schinzel's result for n 2 + 1 (and in principle the method works for 
n 2 + b also). The aim of this paper to obtain a better grasp on the 
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set of terms with no primitive divisor. We will also consider whether 
the set of indices n for which P n has a primitive divisor has a natural 
density. Apparently this lies quite deep. 

For other interesting approaches to the study of divisors of quadratic 
integral polynomials; consult [i,[10],[n],[l8],[l9j,[23],[2l]and[30]. 
For higher order polynomials there is also the paper [9]. 

1.3. The greatest prime factor. Let P + [m) denote the greatest 
prime factor of the integer m > 1. There is a wealth of literature 
about P + {n 2 + b) concerned with the fact that P + (n 2 + b) — > oo as 
n — > oo, see j27J Chapter 7]. In a slightly different direction, Luca 
[22] has recently revived an old method of Lehmer's [21] to show that, 
given B, the set of indices for which P + (n 2 + 1) < B is efficiently 
computable. Carmichael's result mentioned earlier for Lucas sequences 
plays a key role. He illustrates his method by showing that when 
B = 101, n < 24208144. 

The following is an easy proposition, see [6] or [14] . which relates 
P + (n 2 + b) to the existence of a primitive divisor. 

Proposition 1.3. For alln > \b\, the term P n = n 2 + b has a primitive 
divisor if and only if P + (n 2 + b) > In. For all n > \b\, if P n has a 
primitive divisor then that primitive divisor is a prime and it is unique. 

Proof of Theorem M.B. Results of Schinzel fZE\ Th. 13] show that for 
any a > 0, P + (n 2 + b) is bounded above by n a for infinitely many n. 
Taking a = |, Proposition 11.31 shows that P n = n 2 + b fails to have a 
primitive divisor infinitely often. □ 

Given x > 1, Schinzel's method constructs fewer than logx terms 
P„ with n < x having no primitive divisor. For a > |||, Dartyge [8] 
showed that 

\{n < x : P + (n 2 + 1) < x a }\ > x. 

It should be noted that the implied constant is very small, involving, 
as it does, a term 2 _<5 where 5 "est extrement petit" [8j p. 3 line 10]. 
In this paper we prove the following, which provides good upper and 
lower estimates for the number of terms with a primitive divisor. 

Theorem 1.4. Supposing —b is not an integer square, define 

Pb(x) = | {n < x : n 2 + b has a primitive divisor }| . 

For all sufficiently large x we have 

Pb(x) 

0.5324 < f-^-L < 0.905. 

x 
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1.4. Natural density. Integers m with the property P + (m) > 2y/m 
were studied by Chowla and Todd [5|. They proved that the set of these 
numbers has natural density log 2. Perhaps this suggests the following: 

Conjecture 1.5. If —b is not an integer square then pb{x) ~ xlog2. 

With the availability and power of modern computers, one would 
usually resort to some computational evidence in support of such a 
conjecture. The authors of [H] looked for such evidence. Whilst they 
found nothing to clearly contradict the conjecture, neither did they 
find overwhelming evidence to support it. The problem is that the 
convergence to the natural density is very slow. 

The reason for this might best be explained as follows. Chowla and 
Todd's proof uses Mertens' Theorem about the asymptotic formula for 
the sum of inverse primes: 

V - = loglogx + C + O ( — !— 
^— ' p \ log x 

The main term of this formula grows very slowly and the error term 
shrinks very slowly as well. Perhaps, somehow, this lies behind the 
extremely slow convergence to the natural density of terms with prim- 
itive divisor, as in Conjecture 11.51 In addition, the arithmetical nature 
of the sequence n 2 + b plays a significant role when discussing its very 
large prime divisors (see (II ip below) and this will affect what happens 
for 'small' x. Our paper concludes with an explanation as to why we 
are not holding our breath about a proof of Conjecture 11.51 

2. Simple bounds 

The article [H] gives some simple estimates for pb{x) which are 
sketched below. These are recalled here as a way in to the harder meth- 
ods. The first bound in counts indices which produce no primitive 
divisor. It is much better than the bound obtained from [26] but the 
set of indices still has density zero and perhaps indicates the limit of 
elementary methods. The second bound in (TjQ) is very easy but already 
gives a good estimate for the density of terms with a primitive divisor 
if it exists. 

Theorem 2.1. For all sufficiently large x, 

<Ci- pb{x) and \x — p b (x) <C ■ (1) 



log x log X 

The proofs use little apart from well-known estimates for sums over 
primes, which can be found in the book of Apostol pQ. Both begin with 
an old idea of Chebychev which is used frequently as the starting point 
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of investigating the greatest prime factor of certain sequences (see [2Ql 
Chapter 2] for example). 

Apart from a finite number of primes, any prime p that divides n 2 + b 
has the property that —b is a quadratic residue modulo p. Let 1Z denote 
the set of odd primes for which —b is a quadratic residue; notice that 1Z 
comprises the intersection of a finite union of arithmetic progressions 
with the set of primes. Write 

X 

Qx J^J I P-n | 

n=l 

and denote by u)(Q x ) the number of prime divisors of Q x . By Propo- 
sition CL3] it is sufficient to bound u(Q x ) because, with finitely many 
exceptions, a primitive divisor is unique. 

2.1. Sketch proof of Theorem 12.11 Define 

S = {p E 1Z : p\Q x ,p < 2x} and S' = {p G TZ : p\Q x ,P > 2x). 

Let s = \S\ and s' = \S'\. We seek bounds for s + s'. By Dirichlet's The- 
orem on primes in arithmetic progression it is sufficient to estimate s'. 
Following Chebychev's method, use Stirling's Formula to obtain 

^2e p logp = \ogQ x = 2x\ogx + 0(x) (2) 
p\Qx 

where the left-hand side corresponds to the prime decomposition of Q x , 
for positive integers e p . The sum on the left-hand side of (j2J) decom- 
poses according to the definitions of S and S' to give 

^2e p \ogp+ J^logp = logQz, (3) 

pes pes' 

noting that e p — 1 whenever p > 2x. It is easy to show that 

e p logp = x logx + 0(x). (4) 

pes 

Combining (j2J), (jSI) and (j4]) gives 

x log x + O(x) = logp. (5) 

The right hand side is bounded above by s' log(x 2 + 1) yielding a lower 
bound for s'. 
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The second bound in (OQ) arises similarly using a finer partition of 
the set S' 

T = {p e TZ : p\Q x , 2x < p < Kx}; 
U = {peTZ \p\Q x ,Kx <p}, 

for K > 2. Write t = \T\ and u = \U\ then we seek an upper bound for 
expression t+u. Using the definitions of T and U as well as equation (jl]) 
shows that 

^^logp + ^^logp = a; log x + 0(x). 

The extra leverage comes because the left-hand side is greater than 

t log x + u \og(Kx) . 

Now K can be chosen judiciously to beat the other O-constants. An 
upper bound for t + u follows easily and hence the second bound in (CQ) . 

Note Actually K can be taken as large as logo; which yields 

x log log X , . 

— < x - p b {x) 

logx 

for all large x. But this still fails to produce a positive density set. 

3. Better bounding 

It is the aim of this section to prove Theorem 11.41 Take 6=1 
for simplicity, so we can drop the subscript b on p; as with [10] the 
arguments in [20] can be used to generalise to b ^ 1. We then prove 
the following. 

Theorem 3.1. For all sufficiently large x we have 

0.5324 < < 0.905. 

x 

Let 

NM= E L 

x<n<2x 
p\n 2 +l 

The previous section shows it is sufficient to estimate 

E N M- 

p>2x 

Re-casting (jSJ) using this definition: 

N x {p) \ogp = xlogx +0(x). (6) 

p>2x 



ON PRIMITIVE DIVISORS OF n 2 + b 



7 



The extreme cases arise if most of the contribution to this sum comes 
from p around 2x in size, or around 4x 2 in size. In the former case the 
bound logp > logx gives the trivial bound 

p>2x 

which is weaker than the first bound of the last section. On the other 
hand, logp < 2\ogx + 0(1) gives 

Y,N x (p)>±x + o(x), (8) 

p>2x 

which is essentially the second bound of the last section. 

We could obtain improved results if we had better information about 
the following expression: 

v x (v)= J2 N M- 

v<p<ev 

It is a good exercise to show that 

V,{v) ~ ^- (9) 
log V 

implies the conjecture. Unfortunately, the asymptotic formula (Q is 
not expected to be true for very large v , in view of the arithmetic nature 
of n 2 + 1 (see below). However, it is expected that (Q will be true for 
v < x 2 ~ e for any e > and this suffices to prove the conjecture. 

3.1. A better upper bound for p(x). We begin by modifying the 
definitions to allow us to use the Deshouillers-Iwaniec method in [TO] . 
To be precise we must use smooth functions in order to apply the mean- 
value estimates in [10] for Kloostermann sums. Let e, rj be two small 
positive quantities. Let b(u) be a function satisfying b{u) G [0, 1] for 
all «6R, with 



b(u) 
d r b(u) 



1 if (1 + e)x < u < (2 - e)x 
if u < x or u > 2x, 

•C rf u~ r for all r G N. 



du r 

We redefine N x (p) to be 

N X (P)= E b(n). 



x<n<2x 
p\n 2 +l 
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An upper bound for this summed over p will give us an upper bound 
for the original problem, since the two quantities will differ by at most 

§ ex. 



p2x 

X = b(u)du, \A d \ = b ( 7 

_a , 1— n/ J J\ 



Now write 

-2r 

' ^ '(//) 

n 2 +l=0 (modd) 

By the working on [TQl P-2] we can modify the Chebychev argument to 
give 

\Ap\ hgp = 2Xlogx + 0(x). 

p 

Also, as shown in [10], we have 

^2 \A P \ = Xlogx + 0(x). 

p<x 

Let 

P x = max p = x a say. 

We therefore have 

\A P \ \ogp = X log x + O(x). 

x<p<P x 

Deshouillers and Iwaniec then estimate this sum as 

1<3<J 

where Vj = Tx and 

S(x,V 3 )= C M^gp. 

V j<P <AV 3 

Here the infinitely different iable functions Cj(u) G [0, 1] are supported 
in [Vj^Vj), with 



1 if 2x < u < P x 
if u < x or u > P x . 



1<3<J 

After several transformations and an application of the Rosser-Iwaniec 
sieve in tandem with their own sophisticated mean-value estimate for 
averages of Kloostermann sums, they prove that 

Vi) 5 ids: / ^«>^ rf « f 1 + (i^)) • ^ 
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Here Dj = x 1 v Vj 2 . From this they deduce that a is not less than the 
solution to 

2-a-21og(2-a) = f. 

That is, a = 1.202468. . .. 

Now, the worst case scenario for the upper bound (jTj) is if (flO]) holds 
with equality for each Vj. This gives 

X>»(p)< E S(X,V j )QogV j )- l + 0(xQogx)- 1 ) 

p>2x l<j<J 

= x(l + 0(\ogx)- 1 ) J Y~tj2 dt 

= (2a - \)x (1 + O(ilogx)- 1 )) < 0.905a;. 

3.2. A better lower bound for p(x). Now we need to show that not 
all the contribution comes from primes near x 2 . This is a relatively 
simple application of an upper bound sieve to the set 

{m : m£ = n 2 + b, x < n < 2x} for £ in some range. 

In this case we can apply the sieve with distribution level 

x 

Dp 



£(logx) 



for some A, by an elementary argument: this corresponds to Vj/x in 
the last section. Of course, this is why the elementary argument is 
no good for Vj near x in size. The crossover point between the two 

4 

methods is at Vj — xs 7 but we can get nowhere near this value for the 
problem discussed in [10]. For I = 1 the problem is the well-known 
one of representing almost-primes by values of n 2 + 1 and giving an 
upper bound for the number of prime values of this polynomial. By 
[TBI Theorem 5.3] (or see [151 p. 66]) we have 

T i < ^- TT fi - ^ fi + o f lQ f og3x )) ■ (ii) 

n 2 +l=p 

Here x( n ) is the non-trivial character (mod 4). Note the important 
product over primes above which encodes arithmetical information rel- 
evant to the polynomial n 2 + l. This did not arise in the previous section 
since summing over a sufficiently long range for £ smooths out this fac- 
tor (compare [TQ1 §8]). It is expected that ffTTl) holds with equality if 
the factor 2 is replaced by | on the right-hand side, see [TFl E] . 
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We obtain our desired bound by first considering 

W(L,x)= EE 1 - 



L<£<eL x<n<2x 



Write u(d) for the number of solutions to n 2 + 1 = (mode?) and let 
{^d}d<D be the Rosser upper bound sieve of level D L = x(L(loga;) A ) _1 
as described in [TOl §4] and explicitly constructed in |T5l Chapter 4]. 
We then have 



w(L, x )< E E Arf E 1 

L<£<eL d<D L x<n<2x 

n 2 +1=0 (mod dt) 

E E A ^)(i+°( 1 ) 



L<£<eL d<D L 

U)(d£) 



x E E Xd ^nr + {LD L (hgx) 2 ) 



L<i<eL d<D L 

In the above we have noted that u(di) < r{d)r{€) and used the well- 
known average value of the divisor function r{n) to give a bound for 
the error term. We then use a similar analysis to that in [TUl §8] to 
produce the 'main term'. We give all the details that differ from [TU] 
here for completeness. 
Firstly write 

L<£<eLd<D L d<D L 

Now put 

uj{dm) 



L(s,d) = J2 



uj(d)m s 
Note by [TOj, Lemma 4] that 



P \d 

Using Perron's formula ([291 Theorem 3.12]) with T = x,c = (logs)" 1 
we have 

i pc+ix (pT\ s T s / 

J{d, L) = / L{s + 1, d)^ — ds + O (a;-3 

2m J c _ ix s V 

The final term is negligible. (Actually, this has been estimated very 
crudely, in reality it is 0(x e_1 )). Now take the contour of integration 
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The pair of integrals on Ims = ±x give a negligible contribution 

2 

((^(a; 6 ^)), using 

max(|C(s)|, \L(s,x)\) < for 1 < |Ims| < T, Res > \. 
The integral on the new contour can be estimated using: 



IC(£ + **)!' 
1 + 1*1 



dt < (logT) s 



with the same bound applying when ((s) is replaced by L(s,x), to- 
gether with ([29, p.135]) 

1 



ai+it) 



and 



,s+l 



< logT for t < T, 



p\d x ^ 



< r(d). 



This gives a bound for the integral which is 

r(d)(\ogx) 3 



Thus 

E E A - 

L<£<eL d<D L 

Here 



u(d£) 



E A 



w'(d) 



<i<D L 



o;(d)r(d)(logx) 



u/(d) = us 



p\d 



The rest of the working follows mutatis mutandis from [TUl p. 10]. Hence 



E Ea 



2.r 



log^L 



1+0 



log^ 



■O 



x(logx) 



L<£<eL d<D L 

The reader can thus see that the extra error term 0{x{\.ogx) 7 L~^) (the 
log power could be reduced here by more careful working) corresponds 
to the averaging over i smoothing out the influence of the product in 
(II ip . and this must dominate the main term for small L since the 'main 
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term' will be incorrect in this case. Assuming that Dl = xL 1 (logx) 4 
and x*>L> (log x) 18 we obtain 

W(N, L) < + O (yr^) ■ (12) 

\ogD L \{]ogx) 2 J 

For L < (logo;) 18 we can establish a slightly cruder upper bound 
as follows. For each value of £ we do not sieve by primes dividing £. 
This makes the A^ depend on £, but we have A^ = if (d,£) > 1. 
Hence we can write u(d£) = oj{d)uj{£). Following the analysis above, 
the remainder term remains 0(LD ^(logx) 2 ). The 'main term' for the 
upper bound is now 

2x oj(£) -n- x(v) \ . Kx 



\ogD L ^ 6(e) V p-lj ~ \ogD L 

for some absolute constant K. The contribution from the terms with 
L < (logx) 18 is thus (loglogx)(logx) _1 times the total contribution 
for larger L. These terms may therefore be neglected asymptotically. 
Now the worst case scenario for ([7]) has equality in ( fT2l) for 

r-2 



x 2 - 



/ 2t 
dt = 1. 
t — 1 



In other words, 9 is the solution to 

2(2-0)-21og(0-l) = 1. 
This is the limit of the sequence 

ai = 2, a n+ i = i + a n - log(a n - 1)^ (n > 1), 

quickly giving the value 1.766249 . . .. We then calculate 
. 2 2 



dt = 29-3> 0.5324 . . . 



4. Some implications of Conjecture 11.51 

The following argument shows that we do not expect Conjecture 11.51 
to be settled in the near future. In the previous section we have used 
the tools that have been developed for the investigation of the greatest 
prime factor of n 2 + 1 to obtain (rather weak) approximations to the 
conjecture. Now we assume the conjecture and demonstrate that it 
would lead to a phenomenal improvement for the greatest prime factor 
problem. 
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The conjecture leads to 

p>2x 

By the Chebychev argument ([6]), on average in these sums, 
lose v I 

~ = ° ( sa y) =i-44i6... 

log x log 2 

Hence the greatest prime factor of n 2 + 1 infinitely often exceeds n CT . 
This more than doubles the improvement of Deshouillers-Iwaniec over 
the trivial estimate! However, we can do still better using the elemen- 
tary bound from the last section. The worst case scenario now has all 
the contribution to the left hand side of ($5$) coming from p close to x° . 
Since the bounds of the last section must hold (and they are better than 
the Deshouillers-Iwaniec estimates in this region), this corresponds to 
finding a < a < (3 with 

dt = \og2, / -dt = l. 



t-l J a t 

A little bit of manipulation gives the solution to be 

P = 1 + 1 ~ l0g t 2 = 1.52383 . . . (13) 

2-y/2 

This gives the following result. 

Theorem 4.1. If Conjecture 1 1 . 51 is true, then infinitely often the great- 
est prime factor of n 2 + 1 exceeds vP where f3 is given by ( TT3l) . 
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